Main #11

clementchadebec · 2025-12-09T10:36:30Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…huggingface#12259) docs: Fix VAE scale factor calculation

…follow up on huggingface#11873 (huggingface#12264) * propagate fixes from huggingface#11873 to flux script * propagate fixes from huggingface#11873 to flux script * propagate fixes from huggingface#11873 to flux script * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…ce#12266) update

…gingface#12236) * feat: try loading fa3 using kernels when available. * up * change to Hub. * up * up * up * switch env var. * up * up * up * up * up * up

* refresh * feedback * feedback * supported models * fix

* initial commit * update * updates * update * update * update * update * update * update * addressed PR comments * update * addressed PR comments * update * update * update * update * update * update * updates * update * update * addressed PR comments * updates * code formatting * update * addressed PR comments * addressed PR comments * addressed PR comments * addressed PR comments * fix docs and dependencies * fixed dependency test --------- Co-authored-by: Sayak Paul <[email protected]>

* feat: add a test for aot. * up

@tolgacangoz

* Add AttentionMixin to WanVACETransformer3DModel to enable methods like `set_attn_processor()`. * Import AttentionMixin in transformer_wan_vace.py Special thanks to @tolgacangoz 🙇‍♂️

Signed-off-by: co63oc <[email protected]>

init

init Co-authored-by: Sayak Paul <[email protected]>

* init * fix * feedback * feedback

* add qwen modular

* add qwen-image-cn-inpaint --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: yiyixuxu <[email protected]>

Co-authored-by: J石页 <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* Update utils.py not perfect but works engine: https://github.com/calcuis/gguf-connector/blob/main/src/gguf_connector/quant2c.py inference example(s): https://github.com/calcuis/gguf-connector/blob/main/src/gguf_connector/k6.py https://github.com/calcuis/gguf-connector/blob/main/src/gguf_connector/k5.py gguf file sample(s): https://huggingface.co/calcuis/kontext-gguf/tree/main https://huggingface.co/calcuis/krea-gguf/tree/main * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…2290) adjust criteria for XPU Signed-off-by: Liu, Kaixuan <[email protected]> Co-authored-by: Aryan <[email protected]>

…ace#12283) * feat: support group offloading at the pipeline level. * add tests * up * [docs] Pipeline group offloading (huggingface#12286) init Co-authored-by: Sayak Paul <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

fix flux modular pipelines for t2i and i2i

) * add * add a test

…ingface#12309) fix the device for textencoder

…mponents (huggingface#12234) * allow non list components_to_quantize. * up * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Steven Liu <[email protected]> * [docs] components_to_quantize (huggingface#12287) init Co-authored-by: Sayak Paul <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

Co-authored-by: YiYi Xu <[email protected]>

…gface#12271) * deprecate slicing from flux pipeline. * propagate. * tiling * up * up

* Use SDP on BF16 in GPU/HPU migration Signed-off-by: Daniel Socek <[email protected]> * Formatting fix for enabling SDP with BF16 precision on HPU Signed-off-by: Daniel Socek <[email protected]> --------- Signed-off-by: Daniel Socek <[email protected]>

* support Wan2.2-VACE-Fun-A14B * support Wan2.2-VACE-Fun-A14B * support Wan2.2-VACE-Fun-A14B * Apply style fixes * test --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* fixed bug in defining embed dim * matched 1d temb process to 2d * Update src/diffusers/models/unets/unet_1d.py Co-authored-by: Dhruv Nair <[email protected]> --------- Co-authored-by: Dhruv Nair <[email protected]>

* Added LucyEditPipeline * add import & stype missing copied from * Fix example doc string --------- Co-authored-by: yiyixuxu <[email protected]>

* Update autoencoder_kl_wan.py When using the Wan2.2 VAE, the spatial compression ratio calculated here is incorrect. It should be 16 instead of 8. Pass it in directly via the config to ensure it’s correct here. * Update autoencoder_kl_wan.py

* fix hidream type hint * fix hunyuan-video type hint * fix many type hint * fix many type hint errors * fix many type hint errors * fix many type hint errors * make stype & make quality

* add ovis_image * fix code quality * optimize pipeline_ovis_image.py according to the feedbacks * optimize imports * add docs * make style * make style * add ovis to toctree * oops --------- Co-authored-by: YiYi Xu <[email protected]>

…g with empty dim. (huggingface#12770) * Refactor image padding logic to pervent zero tensor in transformer_z_image.py * Apply style fixes * Add more support to fix repeat bug on tpu devices. * Fix for dynamo compile error for multi if-branches. --------- Co-authored-by: Mingjia Li <[email protected]> Co-authored-by: Mingjia Li <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…orking properly (huggingface#12721) * Fixes huggingface#12673. Wrong default_stream is used. leading to wrong execution order when record_steram is enabled. * update * Update test --------- Co-authored-by: Sayak Paul <[email protected]>

…2765) * start varlen variants for attn backend kernels. * maybe unflatten heads. * updates * remove unused function. * doc * up

* remove attn_processors property * more * up * up more. * up * add AttentionMixin to AuraFlow. * up * up * up * up

* update * update * Revert "update" This reverts commit 7390638. * Revert "update" This reverts commit 21a03f9. * update * update * update * update * update

* add transformer pipeline first version --------- Co-authored-by: Álvaro Somoza <[email protected]> Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Charles <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: dmitrienkoae <[email protected]> Co-authored-by: nvvaulin <[email protected]>

…12639) * Fix(peft): Re-apply group offloading after deleting adapters * Test: Add regression test for group offloading + delete_adapters * Test: Add assertions to verify output changes after deletion * Test: Add try/finally to clean up group offloading hooks --------- Co-authored-by: Sayak Paul <[email protected]>

fix hunuyanvideo 1.5 offloading tests.

…gingface#12741) * start zimage model tests. * up * up * up * up * up * up * up * up * up * up * up * up * Revert "up" This reverts commit bca3e27. * expand upon compilation failure reason. * Update tests/models/transformers/test_models_transformer_z_image.py Co-authored-by: dg845 <[email protected]> * reinitialize the padding tokens to ones to prevent NaN problems. * updates * up * skipping ZImage DiT tests * up * up --------- Co-authored-by: dg845 <[email protected]>

* Z-Image-Turbo `from_single_file` * compute_dtype * -device cast

…uggingface#12767) refactor: add type hints and update docstrings for UniPCMultistepScheduler parameters and methods.

…encode (huggingface#12753) fix spatial compression ratio compute error for AutoEncoderKLWan Co-authored-by: lirui.926 <[email protected]>

up Co-authored-by: Álvaro Somoza <[email protected]>

…mentation (huggingface#12791) fix timestepembeddings downscale_freq_shift to be consitant with Photoroom's original code

…ne layers (huggingface#12692) * fix: group offloading to support standalone computational layers in block-level offloading * test: for models with standalone and deeply nested layers in block-level offloading * feat: support for block-level offloading in group offloading config * fix: group offload block modules to AutoencoderKL and AutoencoderKLWan * fix: update group offloading tests to use AutoencoderKL and adjust input dimensions * refactor: streamline block offloading logic * Apply style fixes * update tests * update * fix for failing tests * clean up * revert to use skip_keys * clean up --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Dhruv Nair <[email protected]>

* initial * toctree * fix * apply review and fix * Update docs/source/en/api/pipelines/z_image.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/api/pipelines/z_image.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/api/pipelines/z_image.md Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

up

…ggingface#12796) * feat: Add `flow_prediction` to `prediction_type`, introduce `use_flow_sigmas`, `flow_shift`, `use_dynamic_shifting`, and `time_shift_type` parameters, and refine type hints for various arguments. * style: reformat argument wrapping in `_convert_to_beta` and `index_for_timestep` method signatures.

* init taylor_seer cache * make compatible with any tuple size returned * use logger for printing, add warmup feature * still update in warmup steps * refractor, add docs * add configurable cache, skip compute module * allow special cache ids only * add stop_predicts (cooldown) * update docs * apply ruff * update to handle multple calls per timestep * refractor to use state manager * fix format & doc * chores: naming, remove redundancy * add docs * quality & style * fix taylor precision * Apply style fixes * add tests * Apply style fixes * Remove TaylorSeerCacheTesterMixin from flux2 tests * rename identifiers, use more expressive taylor predict loop * torch compile compatible * Apply style fixes * Update src/diffusers/hooks/taylorseer_cache.py Co-authored-by: Dhruv Nair <[email protected]> * update docs * make fix-copies * fix example usage. * remove tests on flux kontext --------- Co-authored-by: toilaluan <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Dhruv Nair <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

Update the naming Co-authored-by: Sayak Paul <[email protected]>

* add post init for safty checker Signed-off-by: jiqing-feng <[email protected]> * check transformers version before post init Signed-off-by: jiqing-feng <[email protected]> * Apply style fixes --------- Signed-off-by: jiqing-feng <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* support step-distilled * style

* Add ZImageImg2ImgPipeline Updated the pipeline structure to include ZImageImg2ImgPipeline alongside ZImagePipeline. Implemented the ZImageImg2ImgPipeline class for image-to-image transformations, including necessary methods for encoding prompts, preparing latents, and denoising. Enhanced the auto_pipeline to map the new ZImageImg2ImgPipeline for image generation tasks. Added unit tests for ZImageImg2ImgPipeline to ensure functionality and performance. Updated dummy objects to include ZImageImg2ImgPipeline for testing purposes. * Address review comments for ZImageImg2ImgPipeline - Add `# Copied from` annotations to encode_prompt and _encode_prompt - Add ZImagePipeline to auto_pipeline.py for AutoPipeline support * Add ZImage pipeline documentation --------- Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Álvaro Somoza <[email protected]>

* Reimplement img2seq & seq2img in PRX to enable ONNX build without Col2Im (incompatible with TensorRT). * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <[email protected]>

…py (huggingface#12798) feat: add flow sigmas, dynamic shifting, and refine type hints in DPMSolverSinglestepScheduler

Men1scus and others added 30 commits September 1, 2025 16:34

[docs] Fix VAE scale factor calculation in distributed inference docs (…

9e4a75b

…huggingface#12259) docs: Fix VAE scale factor calculation

[CI] Remove big accelerator requirements from Quanto Tests (huggingfa…

bcd4d77

…ce#12266) update

[core] use kernels to support _flash_3_hub attention backend (hug…

130fd8d

…gingface#12236) * feat: try loading fa3 using kernels when available. * up * change to Hub. * up * up * up * switch env var. * up * up * up * up * up * up

[docs] AutoPipeline (huggingface#12160)

6549b04

* refresh * feedback * feedback * supported models * fix

[tests] feat: add AoT compilation tests (huggingface#12203)

ffc8c0c

* feat: add a test for aot. * up

Add AttentionMixin to WanVACETransformer3DModel (huggingface#12268)

6682956

* Add AttentionMixin to WanVACETransformer3DModel to enable methods like `set_attn_processor()`. * Import AttentionMixin in transformer_wan_vace.py Special thanks to @tolgacangoz 🙇‍♂️

fix some typos (huggingface#12265)

764b624

Signed-off-by: co63oc <[email protected]>

[docs] Sharing pipelines/models (huggingface#12280)

c2e5ece

init

[docs] Inference section cleanup (huggingface#12281)

32798bf

init Co-authored-by: Sayak Paul <[email protected]>

[docs] Models (huggingface#12248)

fc337d5

* init * fix * feedback * feedback

[Modular] Qwen (huggingface#12220)

f50b18e

* add qwen modular

Support ControlNet-Inpainting for Qwen-Image (huggingface#12301)

4e36bb0

* add qwen-image-cn-inpaint --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: yiyixuxu <[email protected]>

DeepSpeed adaption for flux-kontext (huggingface#12240)

c222570

Co-authored-by: J石页 <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

adjust criteria for marigold-intrinsics example on XPU (huggingface#1…

4067d6c

…2290) adjust criteria for XPU Signed-off-by: Liu, Kaixuan <[email protected]> Co-authored-by: Aryan <[email protected]>

[modular] fix flux modular pipelines for t2i and i2i (huggingface#12272)

f7b7945

fix flux modular pipelines for t2i and i2i

[feat] cache allocator warmup for from_single_model (huggingface#12305

9e7ae56

) * add * add a test

fix for the qwen controlnet pipeline - wrong device can be used (hugg…

e1b7f1f

…ingface#12309) fix the device for textencoder

Fix AttributeError of VisualClozeProcessor (huggingface#12121)

55f0b3d

Co-authored-by: YiYi Xu <[email protected]>

Deprecate slicing and tiling methods from DiffusionPipeline (huggin…

5e181ed

…gface#12271) * deprecate slicing from flux pipeline. * propagate. * tiling * up * up

Add Wan2.2 VACE - Fun (huggingface#12324)

b500140

* support Wan2.2-VACE-Fun-A14B * support Wan2.2-VACE-Fun-A14B * support Wan2.2-VACE-Fun-A14B * Apply style fixes * test --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

fixed bug in defining embed dim for UNet1D (huggingface#12111)

751e250

* fixed bug in defining embed dim * matched 1d temb process to 2d * Update src/diffusers/models/unets/unet_1d.py Co-authored-by: Dhruv Nair <[email protected]> --------- Co-authored-by: Dhruv Nair <[email protected]>

Added LucyEditPipeline (huggingface#12340)

8c72cd1

* Added LucyEditPipeline * add import & stype missing copied from * Fix example doc string --------- Co-authored-by: yiyixuxu <[email protected]>

Fix many type hint errors (huggingface#12289)

efb7a29

* fix hidream type hint * fix hunyuan-video type hint * fix many type hint * fix many type hint errors * fix many type hint errors * fix many type hint errors * make stype & make quality

DoctorKey and others added 30 commits December 2, 2025 11:48

Add support for Ovis-Image (huggingface#12740)

4f136f8

* add ovis_image * fix code quality * optimize pipeline_ovis_image.py according to the feedbacks * optimize imports * add docs * make style * make style * add ovis to toctree * oops --------- Co-authored-by: YiYi Xu <[email protected]>

[core] start varlen variants for attn backend kernels. (huggingface#1…

f48f9c2

…2765) * start varlen variants for attn backend kernels. * maybe unflatten heads. * updates * remove unused function. * doc * up

[core] reuse AttentionMixin for compatible classes (huggingface#12463)

759ea58

* remove attn_processors property * more * up * up more. * up * add AttentionMixin to AuraFlow. * up * up * up * up

Deprecate upcast_vae in SDXL based pipelines (huggingface#12619)

1908c47

* update * update * Revert "update" This reverts commit 7390638. * Revert "update" This reverts commit 21a03f9. * update * update * update * update * update

[tests] fix hunuyanvideo 1.5 offloading tests. (huggingface#12782)

d96cbac

fix hunuyanvideo 1.5 offloading tests.

Z-Image-Turbo from_single_file (huggingface#12756)

6028613

* Z-Image-Turbo `from_single_file` * compute_dtype * -device cast

Update attention_backends.md to format kernels (huggingface#12757)

c318686

Improve docstrings and type hints in scheduling_unipc_multistep.py (h…

2842c14

…uggingface#12767) refactor: add type hints and update docstrings for UniPCMultistepScheduler parameters and methods.

fix spatial compression ratio error for AutoEncoderKLWan doing tiled …

cd00ba6

…encode (huggingface#12753) fix spatial compression ratio compute error for AutoEncoderKLWan Co-authored-by: lirui.926 <[email protected]>

[lora] support more ZImage LoRAs (huggingface#12790)

7de51b8

up Co-authored-by: Álvaro Somoza <[email protected]>

PRX Set downscale_freq_shift to 0 for consistency with internal imple…

8d415a6

…mentation (huggingface#12791) fix timestepembeddings downscale_freq_shift to be consitant with Photoroom's original code

move kandisnky docs.

bb9e713

[docs] minor fixes to kandinsky docs (huggingface#12797)

8430ac2

up

Update the TensorRT-ModelOPT to Nvidia-ModelOPT (huggingface#12793)

5a74319

Update the naming Co-authored-by: Sayak Paul <[email protected]>

[HunyuanVideo1.5] support step-distilled (huggingface#12802)

671149e

* support step-distilled * style

Improve docstrings and type hints in scheduling_dpmsolver_singlestep.…

54fa074

…py (huggingface#12798) feat: add flow sigmas, dynamic shifting, and refine type hints in DPMSolverSinglestepScheduler

Merge branch 'main' into main

c346366

Merge branch 'clipdrop-main' into main

e664396

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Main #11

Main #11

Uh oh!

clementchadebec commented Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Main #11

Are you sure you want to change the base?

Main #11

Uh oh!

Conversation

clementchadebec commented Dec 9, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants